NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DRAGON: Guard LLM Unlearning in Context via Negative Detection and Reasoning

Wang, Yaxuan; Liu, Quan; Liu, Chris Yuhao; Pang, Jinlong; Wei, Wei; Bao, Yujia; Liu, Yang (July 2025, ICML 2025 Workshop on Machine Unlearning for Generative AI)

Full Text Available
DRAGON: Guard LLM Unlearning in Context via Negative Detection and Reasoning

Wang, Yaxuan; Liu, Quan; Liu, Chris Yuhao; Pang, Jinlong; Wei, Wei; Bao, Yujia; Liu, Yang (July 2025, ICML 2025 Workshop on Machine Unlearning for Generative AI)

Full Text Available
Economical and Versatile Subunit Design Principles for Self-Assembled DNA Origami Structures

https://doi.org/10.1021/acsnano.5c06681

Wei, Wei-Shao; Videbæk, Thomas E; Hayakawa, Daichi; Saha, Rupam; Pombo, Juanita; Arya, Gaurav; Rogers, W Benjamin; Fraden, Seth (September 2025, ACS Nano)

We describe a modular design approach for creating versatile DNA origami subunits that can target diverse self-assembled structures. The subunit consists of a constant “core module” with variable “bond modules” and “angle modules” added to its exterior, controlling interaction specificity, strength, and structural geometry. The design features flexible joints between subunits, implemented by using single-stranded angle modules, whose mechanical properties and possible conformations are characterized by cryogenic electron microscopy and coarse-grained molecular modeling. We demonstrate the design’s versatility through the assembly of structures with different Gaussian curvature, including sheets, spherical shells, and tubes. Our findings suggest that incorporating a judicious amount of flexibility in the bonds provides error tolerance in design and fabrication while maintaining target fidelity. Furthermore, off-target assemblies potentially introduced by flexibility can be counterbalanced by increasing the number of distinct bonds. This approach enables precise targeting of specific structural binding angles across a broad range of configurations by eliminating unfavorable interactions.
more » « less
Full Text Available
Numerical analysis of a 1/2-equation model of turbulence

https://doi.org/10.1016/j.physd.2024.134428

Han, Wei-Wei; Fang, Rui; Layton, William (January 2025, Physica D: Nonlinear Phenomena)

Full Text Available
Improving Data Efficiency via Curating LLM-Driven Rating Systems

Pang, Jinlong; Wei, Jiaheng; Shah, Ankit; Zhu, Zhaowei; Wang, Yaxuan; Qian, Chen; Liu, Yang; Bao, Yujia; Wei, Wei (April 2025, The Thirteenth International Conference on Learning Representations)

Instruction tuning is critical for adapting large language models (LLMs) to downstream tasks, and recent studies have demonstrated that small amounts of human-curated data can outperform larger datasets, challenging traditional data scaling laws. While LLM-based data quality rating systems offer a cost-effective alternative to human annotation, they often suffer from inaccuracies and biases, even in powerful models like GPT-4. In this work, we introduce DS2, a Diversity-aware Score curation method for Data Selection. By systematically modeling error patterns through a score transition matrix, DS2 corrects LLM-based scores and promotes diversity in the selected data samples. Our approach shows that a curated subset (just 3.3% of the original dataset) outperforms full-scale datasets (300k samples) across various machine-alignment benchmarks, and matches or surpasses human-aligned datasets such as LIMA with the same sample size (1k samples). These findings challenge conventional data scaling assumptions, highlighting that redundant, low-quality samples can degrade performance and reaffirming that "more can be less."
more » « less
Full Text Available
Improving Data Efficiency via Curating LLM-Driven Rating Systems

Pang, Jinlong; Wei, Jiaheng; Shah, Ankit; Zhu, Zhaowei; Wang, Yaxuan; Qian, Chen; Liu, Yang; Bao, Yujia; Wei, Wei (April 2025, The Thirteenth International Conference on Learning Representations)

Full Text Available
LLM Unlearning via Loss Adjustment with Only Forget Data

Wang, Yaxuan; Wei, Jiaheng; Liu, Chris Yuhao; Pang, Jinlong; Liu, Quan; Shah, Ankit; Bao, Yujia; Liu, Yang; Wei, Wei (April 2025, Thirteenth International Conference on Learning Representations)

Full Text Available
LLM Unlearning via Loss Adjustment with Only Forget Data

Wang, Yaxuan; Wei, Jiaheng; Liu, Chris Yuhao; Pang, Jinlong; Liu, Quan; Shah, Ankit; Bao, Yujia; Liu, Yang; Wei, Wei (April 2025, Thirteenth International Conference on Learning Representations)

Full Text Available
An improved FLARE system for recording and manipulating neuronal activity

https://doi.org/10.1016/j.crmeth.2025.101012

Zhou, Guanwei; Li, Ruonan; Bartolik, Ola; Ma, Yuqian; Wan, Wei Wei; Meng, Jennifer; Hu, Yujia; Ye, Bing; Wang, Wenjing (April 2025, Cell Reports Methods)

Full Text Available
Engineering a cell-based orthogonal ubiquitin transfer cascade for profiling the substrates of RBR E3 Parkin

https://doi.org/10.1016/j.isci.2025.112913

Fang, Shuai; Zhou, Li; Chen, Geng; Zhang, Jing; Wang, Xiaoyu; Jeong, In Ho; Jacobs, Savannah E; Kossmann, Bradley R; Wei, Wei; Liu, Shu; et al (July 2025, iScience)

Full Text Available

« Prev Next »

Search for: All records